RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific aiid Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: A *?/ ?/6'66 ? . 

Source: QXP& 

Date Processed by STIC: f °/ D</ (° ( ^; 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS 
2) 

tOR^flSi^^Q^OVS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, ™30«-42l6. 
PATENTIN 2.1 e-mail help: P atin21hclpQ f uspt0.gov or 

PATENTIN 3.0 c-mail help: P atin3hHn@u S r)to.gov or phone 703-306-41 19 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
wSS PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1 821 - 1.825 effective October I 1990 (old 
mles) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result .n fewer errored sequence 
listings, thus saving time and money. 

C^ e v,r Version ^ 0 r»n he down lo ^H f™m th, USPTO website at the following address : 

http://www.uspto.gov/web/officcs/pac/checker 



Raw Sequence Listing Error Summary 



R: 



ERROR DETECTED 

ATTN; NEW RULES CASES; 

1 Wrapped Nucleic* 

Wrapped Aminos 



• 2 



Invilid Line Length 



3 Misaligned Amino 

Numbering 

< Non- ASCII 



_ Variable Length. 



Patentln 2.0 
"bug" 



_Stcippcd Sequences 
(OLD RULES) 



Skipped Sequences 
(NEW RULES) 




_Usc of n*s Of Xaa's 

(NEW RULES) 



10 Invalid <2I3> 

Response 

11 Useof<220> 



12 !Palentln2.0 



" "bug" 



SUGGESTED CORRECTION SERIAL NUMDER: 

PLEASE DISRECARD ENC LI SH ** ALPHA** HEADERS, WHICH WERE INSERTED BY PTO 

The numberAext at Ihe end of each line "wrapped" down lo the next line. JhU may occur if your Hie 
was retrieved in a word processor after creating h. Please adjust your right margin to J; this will 
prevent •"wrapping/* 

The rules require that a line not exceed 72 characters in length, this includes while spaces. 

The numbering undcTetch 3* amino acid is misaligned. Do not use lab codes between numbers; 
use apace characters, instead. 

The submitted file was not saved in ASCI I(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission b aavtd In ASCII test. . ' 

V"-, v 

Sequence^) contain n't or XaVi representing more lhaH one residue. Per Sequence Rules, 

each n or Xaa can only represent a tingle residue. Please present the maximum number of each 
residue having variable length and indicate in Ihe <220>-<223> section that some may be missing. 

A "bug" in Patcnlln version 2.0 has caused to C220>i<323> section to be mining from amino acid 

sequencers) . Normally, Patentln wcwId'autonuUcalfxgencrale this section from the 

previously coded nucleic acid sequence. Please manually copy tht relevant <220>-<223> section lo 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> .ectlons for 
Artificial or Unknown sequences. 

Sequence^) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do opt insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ ID.NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMDER OF SEQUENCES:" response to Include the skipped sequences. 

Sequenced) missing. If Intentional, please insert the following lines for each skipped sequence 

<2I0> sequence id number 
<400> sequence id number 
000 

Use of n*s and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's arc present. 

In <220> lo <223> section, please explain location of n or Xaa. and which residue norXn represents. 

Per I 823 ofScquence Rules, the only valid <2I3> responses are: Unknown. Artificial Sequence, or 
scientificnamc (GcnusApacies). <220>-<223> section is required when <213> response is Unknown o 
is Artificial Sequence 

Sequence^) missing the <220> "Feature" and associated numeric identifiers and responses. 

Use of <220> lo <223> is MANDATORY if <2I3> "Organism** response is "Artificial Sequence" or 

"Unknown " Please explain source of genetic material in <220> to <223> section. 

(See "Federal Regrstcr." 06/01/1998. Vol. 63. No. 104. pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager** or any other manual means to copy file to floppy disk. 



13 . Misuse of n * n tan only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to reprcsen 

any value not specifically a nucleotide, 

AMC/MH - Biotechnology Systems Dranch -08/21/2001 



sequences for similar errors. 
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OIPE 



OF 



RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/816,669 TIME: 17:20:41 

Input Set : A:\PTO_VSK.txt 

Output Set: N:\CRF3\10042001\I816669.raw 

3 <110> APPLICANT: GARABEDIAN, Michael 

4 TANEJA, Samir 

5 HITTELMAN, Adam 

6 MARKUS, Steven 

8 <120> TITLE OF INVENTION: METHOD FOR SCREENING TRANSCRIPTIONAL COREGULATORY PROTEINS 

9 TRANSCRIPTION FACTORS, AND ANDROGEN RECEPTOR TRANSCRIPTIONAL COREGULATORY 
10 PROTEINS AS TARGETS FOR ANDROGEN RECEPTOR -DEPENDENT DISEASES 

12 <130> FILE REFERENCE: GARABEDIAN=1 . 1A 

14 <140> CURRENT APPLICATION NUMBER: US/09/816,669 

15 <141> CURRENT FILING DATE: 2001-03-26 . 
17 <150> PRIOR APPLICATION NUMBER: 60/225,618 I UOGS NOT UOITiply 

is <i5i> prior filing date: 2000-08-15 Corrected Diskette Needed 

20 <150> PRIOR APPLICATION NUMBER: 60/191,768 ^ . % / <~ 

21 <151> PRIOR FILING DATE: 2000-03-24 CTrCeC^* *. & <*«eS \ ^ 
23 <160> NUMBER OF SEQ ID NOS : 20 / £T ^ < E"Lue4' 

25 <170> SOFTWARE: Patentln version 3.0 o^d &rrer 3iu*v™o>o-\ 

27 <210> SEQ ID NO: 1 U 

28 <211> LENGTH: 474 

29 <212> TYPE: DNA 

30 <213> ORGANISM: human 

32 <4 00> SEQUENCE: 1 

33 atggcgacgc cccctaagcg gcgggcggtg gaggccacgg gggagaaagt gctgcgctac 60 
35 gagaccttca tcagtgacgt gctgcagcgg gacttgcgaa aggtgctgga ccatcgagac 120 
37 aaggtatatg agcagctggc caaatacctt caactgagaa atgtcattga gcgactccag 180 
39 gaagctaagc actcggagtt atatatgcag gtggatttgg gctgtaactt cttcgttgac 240 
41 acagtggtcc cagatacttc acgcatctat gtggccctgg gatatggttt tttcctggag 300 
43 ttgacactgg cagaagctct caagttcatt gatcgtaaga gctctctcct cacagagctc 360 
45 agcaacagcc tcaccaagga ctccatgaat atcaaagccc atatccacat gttgctagag 420 
47 gggcttagag aactacaagg cctgcagaat ttcccagaga agcctcacca ttga 4 74 

50 <210> SEQ ID NO: 2 

51 <211> LENGTH: 157 

52 <212> TYPE: PRT 

53 <213> ORGANISM: human 
55 <400> SEQUENCE: 2 



57 


Met 


Ala 


Thr 


Pro 


Pro 


Lys 


Arg 


Arg 


Ala 


Val 


Glu 


Ala 


Thr 


Gly 


Glu 


Lys 


58 


1 








5 










10 










15 




60 


Val 


Leu 


Arg 


Tyr 


Glu 


Thr 


Phe 


He 


Ser 


Asp 


Val 


Leu 


Gin 


Arg 


Asp 


Leu 


61 








20 










25 










30 






63 


Arg 


Lys 


Val 


Leu 


Asp 


His 


Arg 


Asp 


Lys 


Val 


Tyr 


Glu 


Gin 


Leu 


Ala 


Lys 


64 






35 










40 










45 








66 


Tyr 


Leu 


Gin 


Leu 


Arg 


Asn 


Val 


He 


Glu 


Arg 


Leu 


Gin 


Glu 


Ala 


Lys 


His 


67 




50 










55 










60 










69 


Ser 


Glu 


Leu 


Tyr 


Met 


Gin 


Val 


Asp 


Leu 


Gly 


Cys 


Asn 


Phe 


Phe 


Val 


Asp 


70 


65 










70 










75 










80 


72 


Thr 


Val 


Val 


Pro 


Asp 


Thr 


Ser 


Arg 


He 


Tyr 


Val 


Ala 


Leu 


Gly 


Tyr 


Gly 


73 










85 










90 










95 




75 


Phe 


Phe 


Leu 


Glu 


Leu 


Thr 


Leu 


Ala 


Glu 


Ala 


Leu 


Lys 


Phe 


He 


Asp 


Arg 



file://C:\CRF3\Outhold\VsrI8 1 6669.htm 
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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/816,669 TIME: 17:20:41 

Input Set : A:\PTO_VSK.txt 

Output Set: N:\CRF3\10042001\I816669.raw 

76 100 105 110 

78 Lys Ser Ser Leu Leu Thr Glu Leu Ser Asn Ser Leu Thr Lys Asp Ser 

79 115 120 125 

81 Met Asn . lie Lys Ala His lie His Met Leu Leu Glu Gly Leu Arg Glu 

82 130 135 140 

84 Leu Gin Gly Leu Gin Asn Phe Pro Glu Lys Pro His His 

85 145 150 155 

87 <210> SEQ ID NO: 3 

88 <211> LENGTH: 1097 

89 <212> TYPE: DNA 

90 <213> ORGANISM: human 

92 <4 00> SEQUENCE: 3 

93 aaatgcacaa cccggacgga agtgcctctc cgacagcaga tccaggctcg gagctccaga 60 
95 cgctgggaca ggccgcccgc agaccacccc cgccgcgcgc gggacacgac gccccccgca 120 
97 ggacacgccc atcagcccgg aaacccctga gctgcttctc ccggaggccg atgcccaccc 180 
99 gggagccccc aaagactcgc ggctcccggg ggcacctgca tactcacccg cctgggcctg 240 
101 ggcccccgct gcagggactg gcgccccgag gcctcaaaac cagcgccccc cgccctccgt 300 
103 gccagcccca gccgggaccc cacaaggcaa agaccaagaa gattgtgttt gaggatgagt 360 
105 tgctctccca ggccctcctg ggcgccaaga agcctattgg agccatccct aaggggcata 420 
107 agcctaggcc ccacccagtg cccgactatg agcttaagta cccgccagtg agcagtgaga 4 80 
109 gggaacggag ccgctatgtc gcagtgttcc aggaccagta cggagagttc ttggagctcc 540 
111 agcacgaggt ggggtgtgca caggcaaagc tcaggcagct ggaggccctg ctgagctccc 600 
113 tgcccccacc ccaaagccag aaggaggccc aagttgcagc ccgggtttgg agggagtttg 660 
115 agatgaagcg aatggatcct ggcttcctgg acaagcaggc tcgctgccac tacctgaagg 720 
117 gtaaactgag gcatctcaag actcagatcc agaaattcga tgaccaagga gacagcgagg 780 
119 gctccgtgta cttctaagtg cccctgcaga tgggcagagg gatgcatggg gatgcaggtc 840 
121 ccttgcattt cttggtatct ctcagctttt cctcttgcag ctccccctac caggggtcgc 900 
123 tttctcctgg attgcaaatg cctcttcagt ttggactcag ctctgacagc ccctcctcca 960 
125 ggaaggcctt ccaggacttc ctcctctggg tcctctagct ctgaccctac agggactcca 1020 
127 gatctcaacc tgttccctgg aagtagggcc tgctctccat cccagtgaaa taaacatgta 1080 
129 ttagacacct aaaaaaa 1097 

132 <210> SEQ ID NO: 4 

133 <211> LENGTH: 264 

134 <212> TYPE: PRT 

135 <213> ORGANISM: Human 
137 <400> SEQUENCE: 4 

139 Met His Asn Pro Asp Gly Ser Ala 

140 1 5 

142 Glu Leu Gin Thr Leu Gly Gin Ala 

143 20 

145 Ala Gly His Asp Ala Pro Arg Arg 

146 35 40 

148 Leu Ser Cys Phe Ser Arg Arg Pro 

149 50 55 

151 Thr Arg Gly Ser Arg Gly His Leu 

152 65 70 

154 Pro Pro Leu Gin Gly Leu Ala Pro 

155 85 
157 Arg Pro Pro Cys Gin Pro Gin Pro 



Ser Pro Thr Ala 
10 

Ala Arg Arg Pro 
25 

Thr Arg Pro Ser 

Met Pro Thr Arg 
60 

His Thr His Pro 
75 

Arg Gly Leu Lys 
90 

Gly Pro His Lys 



Asp Pro Gly Ser 
15 

Pro Pro Pro Arg 
30 

Ala Arg Lys Pro 
45 

Glu Pro Pro Lys 

Pro Gly Pro Gly 
80 

Thr Ser Ala Pro 
95 

Ala Lys Thr Lys 
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RAW SEQUENCE LISTING DATE: 10/04/2001 

PATENT APPLICATION: US/09/816, 669 TIME: 17:20:41 

Input Set : A:\PTO_VSK.txt 

Output Set: N:\CRF3\10042001\I816669.raw 

158 100 105 110 



160 


Lys 


He 


Val 


Phe 


Glu 


Asp 


Glu 


Leu 


Leu 


Ser 


Gin 


Ala 


Leu 


Leu 


Gly 


Ala 


161 






115 










120 










125 








163 


Lys 


Lys 


Pro 


He 


Gly 


Ala 


He 


Pro 


Lys 


Gly His 


Lys 


Pro 


Arg 


Pro 


His 


164 




130 










135 










140 










166 


Pro 


Val 


Pro 


Asp 


Tyr 


Glu 


Leu 


Lys 


Tyr 


Pro 


Pro 


Val 


Ser 


Ser 


Glu 


Arg 


167 


145 










150 










155 










160 


169 


Glu 


Arg 


Ser 


Arg 


Tyr 


Val 


Ala 


Val 


Phe 


Gin 


Asp 


Gin 


Tyr Gly 


Glu 


Phe 


170 










165 










170 










175 




172 


Leu 


Glu 


Leu 


Gin 


His 


Glu 


Val 


Gly 


Cys 


Ala 


Gin 


Ala 


Lys 


Leu 


Arg 


Gin 


173 








180 










185 










190 






175 


Leu 


Glu 


Ala 


Leu 


Leu 


Ser 


Ser 


Leu 


Pro 


Pro 


Pro 


Gin 


Ser 


Gin 


Lys 


Glu 


176 






195 










200 










205 








178 


Ala 


Gin 


Val 


Ala 


Ala 


Arg 


Val 


Trp 


Arg 


Glu 


Phe 


Glu 


Met 


Lys 


Arg 


Met 


179 




210 










215 










220 










181 


Asp 


Pro 


Gly 


Phe 


Leu 


Asp 


Lys 


Gin 


Ala 


Arg 


Cys 


His 


Tyr 


Leu 


Lys 


Gly 


182 


225 










230 










235 










240 


184 


Lys 


Leu 


Arg 


His 


Leu 


Lys 


Thr 


Gin 


He 


Gin 


Lys 


Phe 


Asp 


Asp 


Gin 


Gly 


185 










245 










250 










255 




187 


Asp 


Ser 


Glu 


Gly 


Ser 


Val 


Tyr 


Phe 



















188 260 

190 <210> SEQ ID NO: 5 

191 <211> LENGTH: 517 

192 <212> TYPE: DNA 

193 <213> ORGANISM: Human 

195 <220> FEATURE: 

196 <221> NAME/KEY: misc_feature 



197 <223> OTHER INFORMATION: n at position 65 is unknown . *se*-e?r ^oo4 



200 <400> SEQUENCE: 5 

201 gaacggcacg agggcgcgcc acgcgcggga agcggcgcgc ggagcgcgcg cggcgggccg 60 
W--> 203 cgcamgcgag ggagccgagc gcccgmacgc gcccgagcgg acasacgcca gagccgcgcc 120 

205 ccggtfccgag cgcagcgcgc cggccgssyg ggccgccagg ggcgcgcgcg gcggagcgcg 180 
207 gggcgcgmga aaaggggccc ggcggagacc aagggcaggc gcggcccgca agggcgccgg 240 
209 ggaaggcgcc cggcaaggag gcggacaagc ggagcaggcc aacgagacgc gcgcacccac 300 
211 acacgagcgc gagccgccac aacaccacac ccggcccaag gagaacagca cgccaacgcg 360 
213 ccagycacgg cgggcacggg aggcgggcca cacacagcgg ccccgccaag gcacggcgca 420 
215 cggcacaagg gcaccacgcc agacaagcga ggaggcagca cgccgagacc ggccggaggg 4 80 
217 ccgcgaccgc cggagaaaag gaacagagag cccccca 517 

220 <210> SEQ ID NO: 6 

221 <211> LENGTH: 189 

222 <212> TYPE: PRT 

223 <213> ORGANISM: Human 
225 <400> SEQUENCE: 6 

227 Glu Phe Gly Thr Arg Ala Arg Phe Thr Arg Gly Lys Ser Ala Leu Leu 

228 15 10 15 

230 Glu Arg Ala Leu Ala Arg Pro Arg Thr Glu Val Ser Leu Ser Ala Phe 

231 20 25 30 

233 Ala Leu Leu Ser Pro Ser Trp Tyr Ser Thr Ala Arg Ala Val Phe Ser 

234 35 40 45 



file://C:\CRF3\Outhold\VsrI8 16669.htm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/816,669 



DATE: 10/04/2001 
TIME: 17:20:41 



Input Set : A:\PTO_VSK.txt 

Output Set: N:\CRF3\10042001\I816669.raw 



236 


Val 


Ala 


Glu 


Leu 


Gin 


Ser 


Arg 


Leu 


Ala 


Ala 


Leu 


Gly 


Arg 


Gin 


Val 


Gly 


237 




50 










55 










60 










239 


Ala 


Arg 


Val 


Leu 


Asp 


Ala 


Leu 


Val 


Ala 


Arg 


Glu 


Lys 


Gly 


Ala 


Arg 


Arg 


240 


65 










70 










75 










80 


242 


Glu 


Thr 


Lys 


Val 


Leu 


Gly Ala 


Leu 


Leu 


Phe 


Val 


Lys 


Gly 


Ala 


Val 


Trp 


243 










85 










90 










95 




245 


Lys 


Ala 


Leu 


Phe 


Gly 


Lys 


Glu 


Ala 


Asp 


Lys 


Leu 


Glu 


Gin 


Ala 


Asn 


Asp 


246 








100 










105 










110 






248 


Asp 


Ala 


Arg 


Thr 


Phe 


Tyr 


He 


He 


Glu 


Arg 


Glu 


Pro 


Leu 


He 


Asn 


Thr 


249 






115 










120 










125 








251 


Tyr 


He 


Ser 


Val 


Pro 


Lys 


Glu 


Asn 


Ser 


Thr 


Leu 


Asn 


Cys 


Ala 


Ser 


Phe 


252 




130 










135 










140 










254 


Thr 


Ala 


Gly 


He 


Val 


Glu 


Ala 


Val 


Leu 


Thr 


His 


Ser 


Gly 


Phe 


Pro 


Ala 


255 


145 










150 










155 










160 


257 


Lys 


Val 


Thr 


Ala 


His 


Trp 


His 


Lys 


Gly 


Thr 


Thr 


Leu 


Met 


He 


Lys 


Phe 


258 










165 










170 










175 




260 


Glu 


Glu 


Ala 


Val 


He 


Ala 


Arg 


Asp 


Arg 


Leu 


Glu Gly 


Arg 









261 180 185 

263 <210> SEQ ID NO: 7 

264 <211> LENGTH: 126 

265 <212> TYPE: DNA 

266 <213> ORGANISM: Human 

268 <4 00> SEQUENCE: 7 

269 gaattcggca cgaggctcaa gccctacgtg agctacctcg cccctgagag cgaggagacg 60 
271 cccctgacgg ccgcgcagct cttcagcaag ccgttggcgc cttgccatcg aaaaggactt 120 
273 caagga 126 

276 <210> SEQ ID NO: 8 

277 <211> LENGTH: 42 

278 <212> TYPE: PRT 

279 <213> ORGANISM: Human 
281 <400> SEQUENCE: 8 

283 Glu Phe Gly Thr Arg Leu Lys Pro Tyr Val Ser Tyr Leu Ala Pro Glu 

284 15 10 15 

286 Ser Glu Glu Thr Pro Leu Thr Ala Ala Gin Leu Phe Ser Lys Pro Leu 

287 20 25 30 

289 Ala Pro Cys His Arg Lys Gly Leu Gin Gly 

290 35 40 

292 <210> SEQ ID NO: 9 

293 <211> LENGTH: 678 

294 <212> TYPE: DNA 

295 <213> ORGANISM: Human 

297 <220> FEATURE: J 

298 <221> NAME/KEY: misc_feature &ff cir , e< £ , 

299 <223> OTHER INFORMATION: n at position 651 is unknown.-— *V (ncAiAdt 

302 <400> SEQUENCE : 9 4~0 pouUcfl (p*?*?* - 

303 gaattcggca cgaggattca ttgcccccac aaicctaggc ctacccgccg cagtactgat 60 
305 cattctattt ccccctctat tgatccccac ctccaaatat ctcatcaaca accgactaat 120 
307 caccacccaa caatgactaa tcaaactaac ctcaaaacaa atgataacca tacacaacac 180 
309 taaaggacga acctgatctc ttatactagt atccttaatc atttttattg ccacaactaa 240 



The type of errors shown exist throughout 
the Sequence Listing. Please check subsequent 
sequences for similar errors. 
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RAW SEQUENCE LISTING 
PATENT APPLICATION: US/09/816,669 

Input Set : A:\PTO_VSK.txt 

Output Set: N:\CRF3\10042001\I816669.raw 



DATE: 10/04/2001 
TIME: 17:20:42 



W- 



311 
313 
315 
317 
319 
321 
323 
325 
328 
329 
330 
331 
333 
335 
336 
338 
339 
341 
342 
344 
345 
347 
348 
349 
350 
352 
353 
355 
357 
359 
361 
363 
365 
367 
369 
371 
373 
375 
377 
379 
381 
383 
385 
387 
389 
391 
393 
395 
397 



cctcctcgga ctcctgcctc 
catggccatc cccttatgag 
tgccctagcc cacttcttac 
tatcgaaacc atcagcctac 
cattactgca ggccacctac 
cattaacctt cctctacact 
atcgctgtcg ccttaatcca 
acaacacata aaaaaaaa 
<210> SEQ ID NO: 10 
<211> LENGTH: 60 
<212> TYPE: PRT 
<213> ORGANISM: Human 
<400> SEQUENCE: 10 
Glu Phe Gly Thr Arg lie His Cys 
1 5 



actcatttac accaaccacc caactatcta taaacctagc 

cgggcgcagt gattataggc tttcgctcta agattaaaaa 

cacaaggcac acctacaccc cttatcccca tactagttat 

tcattcaacc aatagccctg gccgtacgcc taaccgctaa 

tcatgcacct aattggaagc gccaccctag caatatcaac 

tatcatcttc acaattctaa ttctactgac tatcctaaaa 

agcctacgtt ttcacacttc tagtaagcct if^actc 



leg 



300 
360 
420 
480 
540 
600 
660 
678 



Arg 



He 



His Ser He Ser 



Thr Asn 
40 

Thr Asn Asp Asn 
55 



Ser Thr Asp 
20 

Ser His Gin Gin Pro 
35 

Thr Asn Leu Lys 
50 

<210> SEQ ID NO: 11 
<211> LENGTH: 1918 
<212> TYPE: DNA 
<213> ORGANISM: Human 
<400> SEQUENCE: 11 
gaattccaat gtggtaaagt 
catactggag aaaaacctta 
agcctaactt ttcatcaggc 
ggcaaggttt ttaggcacaa 
aaaccttaca agtgtaatga 
cataaggtca tccatactgg 
actcagaact cacaccttgc 
tgcaatgagt gtgggaaagc 
catactggga aaaaacctta 
cacctggcaa atcaccgaag 
gggaaagcct ttagggtaag atcaagtcta 
aagcgttaca aatgtaatga gtgtggcaag 
catcacagaa tgcataccgg 
caattcactc ctttcacatc 
acagtggcca atccctcatg 
acatcatgta aacatatgtg 
agatttatat ctttgatgaa 
tgaccgatgg taagtgagga 
tttgatatta tacactgtag aatactcaca 
taactcacat acgaaaaggt tgcaggatat 
aaatgtttca ctgagttatt tgaggttttt 
actttgaaat cttattgtgc atccttacac 
ttgggatgga aggatcattg attgggtgaa 



Pro His Asn Pro Arg Pro Thr Arg 

10 15 
Pro Ser He Asp Pro His Leu Gin 
25 30 
His His Pro Thr Met Thr Asn Gin 
45 

His Thr Gin His 
60 



cttcgctcaa acatcacaac ttgcaaggca ttggagagtt 

caagtgtaat gactgtggca gagcctttag tgatcgttca 

ggagagaaac cttacaaatg tcatgaatgc 

gcaactcatc ggcgaattca tactggagag 

gcctttagta tgcattcaaa cctaactacc 

tacaaatgta atcaatgtgg caaggtcttc 

aggactcaca ccggagagaa accttaccga 

cgttcaagcc taaccaccca tcaggcaatc 

gaatgtggca aggtctttac tcaaaatgct 

ggggagaaac cttacaggtg tacagagtgt 

actacccata tggcaatcca cactggagaa 



aatacatact 
ttcatacctt 
gtgtgggaaa 
agagaagcct 
aaatcatcaa 
cttcagtgtt 
caaatgtaat 
aattcatact 



agagaaacct 
agttaatttc 
agttgaagca 
gcagagggtc 
acgaaacaaa 
ttcctaggag 



gtcttcaggc 
tacaaatgag 
attcttgaca 
ttaatagata 
tatccaggcc 
tgtaatatgc 
gaataacagt 
agtccaaata 
ttgtaggcag 
tggaaagcct 
accttccatg 
gatcattgat 



agagttcaaa tcttgcaagt 
tgtggtgagg tcattaggta 
gaatccttac aaatgtagtg 
tgagaggcca taagcaagag 
tcgcaggtta ctaggcatca 
atcctgaggc cattacccag 
ctctggtttc cctgtttgcc 
tgctaaaaat tatatatttt 
tcagttacct tcaccttatg 
actattgcgt ttcaatgtga 
gtgctttctt ggaaagatca 
taggtgaagg attatttcta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



Um ft<L 



c4 * 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/816,669 



DATE: 10/04/2001 
TIME: 17:20:43 



Input Set : A:\PTO_VSK.txt 

Output Set: N:\CRF3\10042001\I816669.raw 



L:14 M:270 C: Current Application Number differs, Replaced Current Application Number 
L:203 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID# : 5 
L:203 M:341 W: (46) n n" or "Xaa" used, for SEQ ID# : 5 

L:323 M:258 W: Mandatory Feature missing, <222> not found for SEQ ID# : 9 
L:323 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 9 

L:1324 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:15 
L:1336 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:16 
L:1348 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:17 
L:1360 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:18 
L:1372 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:19 
L:1384 M:220 C: Keyword misspelled or invalid format, <213> ORGANISM for SEQ ID#:20 
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